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DNA Sequencing By Mass Spectrometry Via 
Exonuclease Degradation 



5 Rackprnund Of Th^ Invention 

The fundamental role that detennining DNA sequences has for the life 
sciences is evident Its importance in the human genome project has been discussed and 
published widely [e.g. J.E. Bishop and M. Waldholz, 1991. Genome. The Story of the Most 
Astonishing Scientific Adventure of Our Time - The Attempt to Map All Genes in the 

10 Human Body, Simon & Schuster, New York]. 

The current state-of-the-art in DNA sequencing is summarized in recent 
review articles [e.g. B. Barrell, The FASEB Journal, i, 40 (1991); G.L. Trainor. Anal . Chem - 
62, 41 8 (1990), and references cited therein]. The most widely used DNA sequencing 
chemistry is the enzymatic chain termination method [F. Sanger et al., Pmr Natl, Acad. Sci. 

15 USA. 2A, 5463 (1977)] which has been adopted for several different sequencing strategies. 
The sequencing reactions are either performed m solution with the use of different DNA 
polymerases, such as the thermophilic Taq DNA polymerase [M.A. Innes, Proc . Natl . Acad . 
Sci. TISA. fi^ 9436 (1988)] or specially modified T7 DNA polymerase ("SEQUENASE") [S. 
Tabor and C.C. Richardson, AmH Sd lJSA. M, 4767 (1987)]. or in conjunction 

20 with the use of polymer supports. See for example S. Stahl et al., Nlirlfilff Acids Rcs.. 16, 
3025 (1988); M. Uhlen, PCT AppUcation WO 89/09282; Cocozza et al., PCT Application 
WO 91/1 1533; and Jones et al., PCT Application WO 92/03575, incorporated by reference 
herein. 

A central, but at the same time limiting, part of ahnost all sequencing 
25 strategies used today is the separation of the base-specifically terminated nested fiagment 
families by polyaciylamide gel lelectrophoresis (PAGE). This method is time-consuming 
and error prone and can result in ambiguous sequence determinations. As a consequence of 
the use of PAGE, highly experienced personnel are often required for the interpretation of the 
sequence ladders obtained by PAGE in order to get reliable results. Automatic sequence 
30 readers very often are unable to handle artifacts such as "smiling", compressions, faint ghost 
bands, etc. This is true for the standard detection methods employing radioactive labeling 
such as 32p, 33p or 35s, as well as for the so-caUed Automatic DNA Sequencers (e.g. 
Applied Biosystems, Millipore, DuPont, Pharmacia) using fluorescent dyes for the detection 

of the sequencing bands. 
3S Apart fiom the time factor tfie biggest limitation of all methods involving 

PAGE as tin integral part, however, is tiie generation of reliable sequence information, and 
the transformation insert of this information into a conqniter format to fadlitate sophisticated 
analysis of the sequence data utilizing existing software and DNA sequence and protem data 
banks. 
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With standard Sanger sequencing 200 to 500 bases of unconfirmed sequence 
information could be obtained in about 24 hours; with automatic DNA sequencers this 
number could be multiplied by approximately a factor of 10 to 20 due to processing several 
samples simultaneously. A further increase in throughput could be achieved by employing 
5 multiplex DNA sequencing [G. Church et al., SdSDSfi, m 185-188 (1988); Koster et al.. 
Nucleic Affirfs Res Svmpnsiiim Ser. No. 24 . 318-21 (1221)] in which, by using a unique tag 
sequence, several sequencing ladders could be detected one after the other from the same 
PAGE after blotting, UV-crosslinking to a membrane, and hybridizations with specific 
complementary tag probes. However, this approach is still very laborious, often requires 
] 0 highly skilled personnel and can be hampered by the use of PAGE as a key element of the 
whole process. 

A large scale sequencing project often starts wift either a cDNA or genomic 
library of large DNA fr^ments inserted in suitable cloning vectors such as cosmid, plasmid ^ 
(e.g. pUC), phagemid (e.g. pEMBL, pGEM) or single-stranded phage (e.g. M13) vectors [T. 

15 Maniatis, E J. Fritsch and J. Sambrook (12S2) Molecailar Cloning. A Laboratory Manual. 
Cold Spring Haibor Laboratory, Cold Spring Harbor, NY.; Msthods in F.n?!YinQlPgY» Vol. 
101 (1983), RecomWnaiit DNA, Part C; Vol. 152 (1987), Recombinant DNA, Part D; Vol. 
154 (1987), Recombinant DNA, Part E; Vol. 15S (1987), Recombinant DNA, Part F and Vol. 
152 (1987), Guide to Molecular Cloning Techniques, Academic Press, New York]. Since 

20 large DNA ftagments currently cannot be sequenced directly in one run because the Sanger 
sequencing chemistry allows only about 200 to 500 bases to be read at a time, the long DNA 
fragments have to be cut into shorter pieces which are separately sequenced. In one approach 
this is done in a fully random manner by using, for example, unspecific DNAse I digestion, 
frequently cutting restriction enzymes, or sonification, and sorting by electrophoresis on 

25 agarose gels [Methods in FnTvmnlogv. auaaj. However, this method is time-consuming and 
often not economical as several sequences are sequenced many times until a contiguous DNA 
sequence is obtained. Very often the ejqwnditure of woric to close the ffps of the total 
sequence is enormous. Consequently, it is desirable to have a method which allows 
sequencing of a long DNA fragment in a non-random, i.e., direct, way from one end through 

30 to the other. Several strategies have been proposed to achieve this [Methfiris of EnT^YmologY. 
supm: S. Henikoflf, QsDS, 2&. 351-59 (1984); S. Hcnikoff, et al.US Patent No. 4,843,003; and 
PCT Application WO 91/12341]. However, none of the currently available sequencing 
methods provide an acceptable method of sequencing megabase DNA sequences in either a 
timely or economical manner. The naain reason for this stems from the use of PAGE as a 

35 central and key element of the overall process. 

In PAGE, under denaturing conditions the nested femilies of terminated DNA 
fragments are separated by the different mobilities of DNA chains of different length. A 
closer inspection, however, reveals that it is not the chain length alone vAdch governs the 
mobility DNA chains by PAGE, but there is a significant influence of base composition on 
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the mobility [R. Frank and H. KOster. Nticigic Acids Res., fi, 2069 (1 979)]. PAGE therefore 
is not only a very slow, but also an unreliable method for the determination of molecular 
weights, as DNA fragments of the same length but different sequence/base composition could 
have different mobilities. Likewise, DNA sequences which have the same mobility could 
5 have different sequence/base compositions. 

The most reliable way for the determination of the sequence/base composition 
of a given DNA fragment would therefore be to correlate the sequence with its molecular 
weight. Mass spectrometry is capable of doing this. The enormous advantage of mass 
spectrometry compared to the above mentioned methods is the speed, which is in the range of 

10 seconds per analysis, and the accuracy of mass determination, as well as the possibility to 
directly read the collected mass data into a computer. The application of mass spectrometry 
for DNA sequencing has been investigated by several groups [e.g. Methods in EnZYmologV. 
Vol. 193: Mass Spectrometry, (J-A. McCloskey, editor), 122Q, Academic Press, New York; 
K.H. Schramm Riomedical Ap plications of Mass SpCCtrQmetrY> 203-287 (1990); P.F. 

15 Grain Mass Spectrometry Reviews. 2, 505 (1990)]. 

Most of the attempts to use mass spectrometry to sequence DNA have used 
stable isotopes for base-specific labeling, as for instance the four sulfur isotopes 32s, 33s, 
34sand36s. See, for example, Brennanetal.,PCr Application WO 89/12694, R^L.Mi^^^ 
United States Patent No. 5,064,754, United States Patent No. 5,002,868, Jacobson et al.; 

20 Hean European Patent Application No. Al 0360676. Most of these methods employ the 
Sanger sequencing chemistry and - which jeopardizes to some extent the advantages of mass 
spectrometry • polyacrylamide gel electrophoresis vsdth some variations such as capillary 
zone electrophoresis (CZE) to separate the nested terminated DNA fragments prior to mass 
spectrometric analysis. 

25 One advantage of PAGE is the property of being a parallel method, i.e,, 

several samples could be analyzed simultaneously (though this is not true for CZE v/Uch is a 
serial method), whereas mass spectrometry allows, in general, only a serial handling of the 
samples. In a US Patent Application No.08/001,323, mass spectrometric DNA sequencing is 
proposed without the use of PAGE, employmg desoiption/ionization techniques applicable to 

30 larger biopolymers such as electrospray (ES) [J.B. Fcnn ct al., J. Phvs, Chcm-, 4451-59 
(1984); Fenn et al., PCT Application No. WO 90/14148; and B. Aidrey, SpsctTOSCQPY 
Europe. 4, 10-18 (1992)] and matrix assisted laser desoiption/ionization (MALDI) mass 
spectrometiy [F. Hillenkamp et al.. Laser Desorption Mass Spectrometiy, Part I: Mechanisms 
and Techniques and Part II: Performance and Application of MALDI of Large Biomolecules, 

35 m Mass Spectmmetrv in the Ri'nlng ical Sciences: A Tutorial (M.L. Gross, editor), 1 65-1 97 
(1222), Kluwer Academic Publishers, The Netherlands] which can facilitate determination of 
DNA sequences by direct measurement of the molecular masses in the mixture of base- 
specifically terminated nested DNA firagments. By integrating tfie concept of multiplexing 
through the use of mass-modified nucleoside triphosphate derivatives, the serial mode of 
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analysis typical for current mass spectrometric methods can be changed to a parallel mode 
. [H. KOster, US Patent Application No. 08/00U23, suuffil. 

MALDI and ES mass spectrometry are in some aspects complementary 
techniques. While ES, using an atmospheric pressure ionization interface (API), can 
5 accommodate continuous flow streams from high-performance liquid chromatoghraphs 
(HPLC) [K.B. Tomer, et al. M>,.c Sne^ctrnmetrv. 2Q, 783-88 (1991)] and capillary 

zone electrophoresis (CZE) [R.D. Smith et al., Anal. Chem-. fid. 436-41 (1 988)] this is 
currently not available for MALDI mass spectrometry. On the other hand, MALDI mass 
spectrometry is less sensitive towards buffer salts and other low molecular weight 

10 components in the analysis of larger molecules with a TOP mass analyzer [HiUenkamp et al. 
(1222), supra]; in contrast ES is very sensitive towards by-products of low volatility. While 
the high mass range in ES mass spectrometry is accessible through the fonnation of multiply 
charged molecular ions, this is achieved in MALDI mass spectrometry by applying a time-of- 
flight (TOP) mass analyzer and the assistance of an appropriate matrix to volatilize the 

15 biomolecules. Similar to ES, a thermospray interfiwe has been used to couple HPLC on-line 
with a mass analyzer. Nucleosides originating from enzymatic hydrolysates have been 
analyzed using such a configuration [C.G. Edmonds et al. Nuclelc Acids RC5., 12, 8197-8206 
(1985)]. However, Edmonds et al, does not disclose a mefliod for nucleic acid sequencing. 

A complementary and completely different approach to determine the DNA 

20 sequence of a long DNA fragment would be to progressively degrade the DNA strand using 
exonucleases from one side - nucleotide by nucleotide. This method has been proposed by 
Jett et al. See J.H. Jett et al. t Rmmnieeular Stnirtiirf ^, Dynamics. 1, 301-309 ( 1 989); and 
J.H. Jett et al. PCT Application No. WO 89/03432. A single molecule of a DNA or RNA 
fragment is suspended in a moving flow stream and contacted with an exonuclease which 

25 cleaves off one nucleotide after the other. Detection of the released nucleotides is 

accomplished by specifically labeling the four nucleotides with four different fluorescent 
dyes and involving laser induced flow cytometric techniques. 

However strategies whidi use a stepwise enzymatic degradation process can 
suffer from the problem that this process is difficult to ^mchronize, i.e.. the enzymatic 

30 reaction soon comes out of phase. Jett etal..aupa have attempted to address this problem 
by degrading just one single DNA or RNA molecule by an exonuclease. However, this 
approach is very hard, as handling a single molecule, keeping it in a moving flow stream, and 
achieving a sensitivity of detection which cleariy identifies one single nucleotide are only 
some of the very difiBcult technical problems to be solved. In addition, in using fluorescent 

35 tags, the physical detection process for a fluorescent si^ involves a time factor difficult to 
control and the necessity to en^loy excitation by lasers can cause photo-bleaching qf the 
fluorescent signal. 

The invention described here addresses most of the problems described above 
inherent to currently existing DNA sequencmg processes and provides chemistries and 
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systems suitable for high-speed DN A sequencing a prerequisite to tackle the human genome 
(and other genome) sequencing projects. 

.<;iiniin5irv Of The Invention 
5 In contrast to most sequencing strategies, the process of this invention does 

not use the Sanger sequencing chemistry, polyacrylamide gel electrophoresis or radioactive, 
fluorescent or chemiluminescent detection. Instead the process of the invention adopts a 
direct sequencing approach, beginning with large DNA fragments cloned into conventional 
cloning vectors, and based on mass spectrometric detection. To achieve this, the DNA is by 

10 means of protection, specificity of enzymatic activity, or immobilization, unilaterally 

degraded in a stepwise manner via exonuclease digestion and the nucleotides or derivatives 
detected by mass spectrometry. 

Prior to this enzymatic degradation, sets of ordered deletions can be created 
which span the whole sequence of the cloned DNA fragment. In this manner, mass-modified 

15 nucleotides can be incorporated using a combination of exonuclease and DNATRNA 

polymerase. This enables either multiplex mass spectrometric detection, or modulation of 
the activity of the exonuclease so as to synchronize the degradative process. In another 
embodiment of the invention the phasing problem can be resolved by continuously applying 
small quantities of the enzymatic reaction mixture onto a moving belt with adjustable speed 

20 for mass ^)ectrometric detection. In yet another embodiment of the invention, the 

throughput could be further increased by applying reaction mixtures firom different reactors 
sunultaneously onto the moving belt In this case the different sets of sequencing reactions 
can be identified by specific mass-modifying labels attached to the four nucleotides. Two- 
dimensional multiplexing can finther mcrease the throughput of exonuclease mediated mass 

25 spectrometric sequencing as being described in this invention. 
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Rppf ne^scription Of The Drawings 

The accompanying drawings form a part of the specifications and serve the 
purpose of providing examples illustrating certain embodiments of the present invention. 
Together with the description they help to explain the principles of the invention without 
5 limiting the scope of it. 

FIGURE 1 illustrates a process of exonuclease sequencing beginning with a 
single-stranded nucleic acid. 

FIGURE 2 illustrates a process similar to FIGURE 1, however, starting with a 
target nucleic acid inserted into a double-stranded vector. 
10 FIGURE 3 illustrates a method for introducing mass-modified nucleotides into 

a target nucleic acid sequence multiplexing mass spectrometry. 

FIGURE 4A and 4B illustrate methods for introducing mass-modified 
nucleotides into a target nucleic acid sequence multiplexing mass spectrometry. 

FIGURE 5 shows positions within a nucleic acid molecule vAdch can be 
1 5 modified for the introduction of discriminating mass increments or modulation of 
exonuclease activity. 

FIGURE 6 illustrates various structures of modified nucleoside triphosphates 
usefiil for the enzymatic incorporation of mass-modified nucleotides into the DNA or RNA 
to be sequenced. 

20 FIGURE 7 shows some possible functional groups (R) usefiil to either mass- 

modification of nucleotides in discrete increments for differentiadon by mass spectrometry 
and/or to modulate the enzymatic activity of an exonuclease. 

FIGURE 8 illustrates some linkmg groups X for the attachment of the mass 
modifying functionality R to nucleosides. 
25 FIGURE 9 is a schematic drawing of a sequencing reactor system. 

FIGURE 1 0 is a graphical representation of idealized output signals following 
the time-course of the stepwise inass spectrometric detection of the exonucleolytically 
released nucleotides. 

FIGURE 1 1 illustrates specific labels introduced by mass modification to 
30 facilitate multiplex exonuclease mass spectrometric sequencing. 

FIGURE 12 is a schematic drawing of a moving belt apparatus for delivering 
single or multiple tracks of exonuclease samples for laser induced mass spectrometric 
sequence determination in conjunction with the sequencing reactor of FIGURE 9. 

FIGURE 1 3 is a schematic representation of individually labeled signal tracks 
35 employed in multiplex exonuclease mediated mass spectrometric sequencing. 

FIGURES 1 4 A and 1 4B illustrate a method for double-stranded exonuclease 
sequencing for mass spectrometric sequence determination. 
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ji^tailpA npsfriptinn Of The Invention 

The starting point for the process of the invention can be, for example, DNA 
cloned from either a genomic or cDNA library, or a piece of DNA isolated and amplified by 
polymerase chain reaction (PCR) which contains a DNA fragment of unknown sequence. 
5 Libraries can be obtained, for instance, by following standard procedures and cloning vectors 
[Maniatis, Fritsch and Sambrook (1282), supii; Methods in RnTymolPgY. Vol. 101 QSH) 
and Vol. 152-155 (1282), supra] . Appropriate cloning vectors are also commercially 
available. As will be apparent, the invention is not limited to the use of any specific vector 
constructs and cloning procedures, but can be applied to any given DNA fragment whether 
10 obtained, for instance, by cloning in any vector or by the Polymerase Chain Reaction (PCR). 
The unknown DNA sequence can be obtained in either double-stranded fonh using standard 
PCR or in a smgle-stranded form employing asymmetric PCR [PCR Technpiogv: Principles 
and Applications for DNA Amplification (Erlich, editor), M. Stockton Press, New York 
0282)]. 

15 For those skilled in the art it is clear that both DNA and RNA can be 

exonucleolytically degraded from either the 5* or 3' end depending upon the choice of 
exonuclease. Similarly the sequence of an unknown DNA molecule can be detennined 
directly by exonuclease digestion, or alternatively, the DNA fragment of unknown sequence 
can be transcribed first into a conqilementaiy RNA copy which is subsequently 

20 exonucleolytically degraded to provide the RNA sequence. Appropriate vectors, such as the 
pGEM (Promega) vectors, are useful in the present as they have specific promoters for either 
the SP6 or T7 DNA depaidttit RNA polymerases flanking the multiple cloning site. This 
feature allows transcription of both unknown DNA strands into complementary RNA for 
subsequent exonuclease sequencing. Furthennore, these vectors, belonging to the class of 

25 phagemid vectors, provide means to generate single stranded DNA of the unknown double 
stranded DNA. Thus, by using two vectors which differ only in the orientation of the fl 
origin of replication, both strands of the unknown DNA sequence can be obtained in a single 
stranded fomi and utilized for subsequent exonuclease sequencing. The scope of tiie 
invention is also not Umited by the choice of restriction sites. There is, however, a preference 

30 for rare cutting restriction sites to keep the unknown DNA fragment unfragmented during the 
manipulations in preparation for exonuclase sequoicing. The following desoiption and the 
FIGURES do serve the purpose by way of examples to illustrate the principles of the 
invention. Variations which are withm the scope of the invention will be obvious to those 
skilled in the art Various mass spectrometric configurations are wiAin the scope of the 

35 invention: For desorption/ionization e.g. fest atomic bombardment (FAB), plasma desorption 
(PD), thermospray (TS) and preferentially 

electrospray ^S) and laser desorption widi and without an appropriate matrix (LD or 
MALDI); as mass analyzer e.g. a time-of-flight (TOF) configuration or a quadrupole is 
applicable. 
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(i) Preparation of unknown nucleic acid sequence for exonuclease sequencing: 

FIGURE 1 describes the process for a single stranded DNA insert ("Target 
5 DNA") ofa single stranded circular cloning vector. The boundaries of the target DNA are 
designated A and B. The target DNA, as illustrated in FIGURE 1, has been cloned into the 
Not I site ofa vector. A synthetic oligodeoxynucleotide [N.D. Sinha, J. Biemat, J. McManus 
and H. Kfister, N.irleic Acids Res .. 12, 4539 (1984)] which will restore the Not I site to 
double-strandedness and which is complementary to the vector sequence flanking the A 

10 boundary of the insert DNA is hybridized to that site and cleaved by Not I restriction 

endonuclease. The two pieces of the synthetic oligodeoxynucleotide can then be removed by 
molecular sieving, membrane filtration, precipitation, or other standard procedures. 

FIGURE 1 also illustrates a set of ordered deletions (tO t2, t^) which can 
be obtained by the time-limited action of aii exonuclease, e.g. T4 DNA polymerase, in the 

15 absence of dNTPs. The set of deletions can be umnobilized on a solid s»q»portTr(Trt>,Tr^ 
Ti2, Tr3), or alternatively, tiie set of ordered deletions can be obtained in a heterogeneous 
reaction by treating tiie solid support Tr^ containing the complete target DNA sequence with 
an exonuclease in a time-limited manner. In tiie instance where the 3' termini of each time 
point are too heterogeneous (i.e., "fazzy") to be analyzed directiy 1^ exonuclease mediated 

20 mass spectrometric sequencing, circularization of tiie template and a cloning step can be 
performed prior to this sequencing process with single transformed colonies selected. 

A single stranded linear DNA fragment carrying tiie unknown sequence witii 
its A boundary at tiie 3' end can be directiy sequenced by an 3' exonuclease in an apparatus 
described below and schematically depicted in HGURE 9, provided tiiat tiie exonuclease is 

25 immobilized witiun tiie reactor on, for wcample, beads, on a fiit, on a membrane located on 
top of tiie frit or on the glass walls ofa cqiillary or entnqiped in a gel matrix or simply by a 
semipermeable membrane which keeps tiie exonuclease in tiie reactor whUe tiie linear DNA 
fragment is circulating tiirough a loop. 

At time intervals, or alternatively as a continuous stream, tfie reaction mixture 

30 containing tiie buffer and tiie released nucleotides is fed to tiie mass spectrometer for mass 
determination and nucleotide identification. In anotiier embodiment, tiie stream containing 
tiie nucleotides released by exonuclease action can be passed tiirough a second reactor or 
series of reactors whidi cause tiie released nucleotide to be modified. For example, tiie 
second reactor can contain an immobilized alkaline phosphatase and tiie nucleotides passing 

35 tiieretiuough are transformed to nucleosides prior to feeding into the mass spectrometer. 
Other mass-modifications are described below. 

In general, when it is tiie released nucleotide (or ribonucleotide) vMch it 
mass-modified, tiie modification should take as few steps as possible and be relatively 
efBcient For example, reactions used in adding base protecting groins for oligonucleotide 



wo 94/21822 



-9- 



PCT/US94/02938 



synthesis can also be used to modify the released nucleotide just prior to mass spectrometric 
analysis. For instance, the amino function of adenine, guanine or cytosine can be modified 
by acylation. The ammo acyl function can be, by way of illustration, an acetyl, benzoyl, 
isobutyryl or anisoyl group. Benzoylchloride, in the presence of pyridine, can acylate the 
5 adenine amino group, as well as the deoxyribose (or ribose) hydroxyl groups. As the 
glycosidic linkage is more susceptible to hydrolysis, the sugar moiety can be selectively 
deacylated if the acyl reaction was not efficient at those sites (i.e., heterogeneity in molecular 
weight arising from incomplete acylation of the sugar). The sugar moiety itself can be the 
target of the mass-modifying chemistry. For example, the sugar moieties can be acylated, 

1 0 trity lated, monomethoxytritylated, etc. Other chemistries for mass-modifying the released 
nucleotides (or ribonucleotides) will be ^parent to those skilled in the art. 

*CIn another embodiment the linear single stranded DNA firagment can be 
anchored to a solid support. This can be achieved, for example, by covalent attachment to a 
functional group on the solid support, such as through a specific oligonucleotide sequence 

1 5 which involves a spacer of sufficient length for the ligase to react and which is covalently 
attached via its 5' end to the support (FIGURE I). A splint oligonucleotide with a sequence 
complementary in part to the solid bound oligonucleotide and to the 5' end of the linearized 
single stranded vector DNA allows covalent attachment of the DNA to be sequenced to the 
solid siqjport. After annealing, ligation (i.e., with T4 DNA ligase) covalently links the soUd 

20 bound oligonucleotide and the DNA to be sequenced. The splint oligonucleotide can be 
subsequently ranoved by a temperature jump and/or NaOH treatment, or washed off the 
support using other standard procedures. The solid support with the linear DNA example is 
transferred to the reactor (FIGURE 9) and contacted with an exonuclease in solution. As 
illustrated, where the 3' end of the unknown DNA fiagment is exposed (ie unprotected), a 3' 

25 exonuclease is employed. The released nucleotides, or modified nucleotides if intermediately 
contacted with a modifying agent such as alkaline phosphatase, are identified by mass 
spectrometry as decribed above. Other linking groups are described herein, and still others 
will be apparent to those skilled in the art from the invention described here. 

The solid siqiports can be of a varieQr of materials and shapes, such as beads 

30 of silica gel, controlled pore glass, cellulose, polyacrylamide, sepharose, sephadex, agarose, 
polystyrene and other polymos, or membranes of polyethylene, polyvinylidendifluoride 
(PVDF) and the like. The solid siq>ports can also be cq)illaries, as well as fiits finom glass or 
polymers. Various 3' exonudeases can be used, such as phosphodiesterase fiom snake 
venom, Exonuclese VH from E. coli, Bal 3 1 exonuclease and the 3'-5' exonuclease activity of 

35 some DNA polymerases exerted m the absence of dNTPs, as for example T4 DNA 

polymerase. 

In using a phagemid vector with an inverted fl origin of replication, the B 
boundary would be located at the 3' end of the immobilized linear single stranded DNA and 
exposed to exonuclease sequencing using the same restriction endonuclease, hybridizing 
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oligodeoxynucleotide and splint oligonucleotide. As another embodiment of this invention, 
the hybridizing oligonucleotide can also be designed to bind a promoter site upstream of the 
A boundary and by doing so restoring the doublestranded promoter DN A. Directly, or with a 
short initiator oligonucleotide carrying an attachment functionality at the 5' end, transcription 
5 can be initiated with the appropriate specific DNA dependent RNA polymerase [Methods in 
Enzvmolopv , Vol. 185, Gene Expression Technology (152fi); J F. Milligan, D.R. Groebe, 
G.W. Witherell and O.C. Uhlenheck. Nucleic Acids Res .. 15. 8783-98 (1987); C. Pitulle, 
R.G. Kleineidam, B. Sproat and G. Krupp, Iifilie, 112, 101-105 (1992) and H. Koster US 
Patent Application No. 08/001,323, supii]. The RNA transcript can be transferred to the 

1 0 reactor (FIGURE 9) and contacted with an immobilized or otherwise contained exonuclease, 
or immobilized via the 5' functionality of the initiator oligonucleotide incorporated in the 
RNA transcript to a solid support and then contacted with an exonuclease in solution. 

Depending on the length of the DNA insert (i.e., number of nucleotides 
between boundary A and B in FIGURE 1) the mass spectrometric exonuclease sequencing 

15 process can allow the complete sequence from A to B to be determined in one run. 

Alternatively, prior to exonuclease sequencing, a set of ordered deletions can be prepared 
accoiding to standard procedures [e.g. Methods in RnTvmologv. Vol 101 (12S1) and Vol. 
152-155 (12SZ); R.MJC Dale et al., Elasmid, 12, 31-40 (1985)], such that, in HGURE 1 the 
steps Tr^ to Tr^ can represent either different time values of the mass spectrometric 

20 exonuclease sequencing reaction from inmiobilized DNA fragments or different starting 
points for the exonuclease DNA/RNA mass spectrometric sequencing process. In either case 
the principle of the invention described provides a process by which the total sequence of the 
insert can be determined. 

In another embodiment of the invention the unknown DNA sequence (target 

25 DNA) is inserted into a double-stranded cloning vector (FIGURE 2) or obtained in double- 
stranded fomi, as for example by a PCR (poljonerase chain reaction) process [£CE 
Technology. (1989^ supra] . The DNA to be sequenced is inserted into a cloning vector, such 
as ligated into the Note I site as illustrated in Figure 2. Adjacent to the A boundaiy there can 
be located another cutting restriction endonuclease site, such as an Asc I endonuclease 

30 cleavage site. The double-stranded circular molecule can be linearized by treatment with Asc 
I endonuclease and ligated to a solid support using a splint oligodeoxynucleotide (and ligase) 
as described above which restores the Asc I restriction site (Tr^ds and Tr^'ds). The strand 
which is not immobilized can be removed by subjecting the double stranded DNA to standard 
denaturing conditions and washing, thereby generating single-stranded DNAs immobilized to 

35 the solid support (Tr^ss and Tr^'ss). Since the unknown double-stranded DNA sequence can 
be ligated in either orientation to the support there can exist two non-identical 3* termini (+ 
and - strand) immobilized, which can result in ambiguous sequencing data. The inmiobilized 
fragment which carries the vector DNA sequence at the 3* end (Tr® ss) can be protected from 
3* exonuclease degradation during the sequencing process by, for example, annealing with an 
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oligodeoxynucleotide complementary to the 3' end of the strand to be protected. As there can 
only be hybridization at one 3' terminus, i.e., to the wrong single-stranded DNA with (-) 
strand information (TrO'ss), some alpha-thio dNTP's can be incorporated into the immobilized 
(-) strand via treatment with a DNA polymerase and completely protect that strand from 
5 exonucleolytic degradation [P.M.J. Burgers and F. Eckstein, PiochcmiStrY, Ifi, 592 (1 979); S. 
Labeit. H. Lehrach and R.S. Goody, DM, 1 173 (1986); S. Labeit, H. Lehrach and R.S. 
Goody in MAthnH^ in FnTvmnlngv. Vol. 155, page 166 (12S2), supra]. If desired, after 
incorporation of exonuclease resistant nucleotides, the oligonucleotide primer may be 
removed by a washing step under standard denaturing conditions. The immobilized single- 

10 stranded DNAs are transferred to the sequencing reactor (FIGURE 9) and the sample with the 
unknown sequence at the 3' end is degraded by an exonuclease in a stepwise manner. The 
liberated nucleotides, or optionally modified nucleotides, are continuously fed into the mass 
spectrometer to elucidate the sequence. 

As above, where the inserted DNA is too long for determining the complete 

15 sequence information bet«reen the boundaries A and B (FIGURE 2) in one nm of 

exonuclease mass spectrometric sequencing, a series of overiapping ordered deletions.can be 
constructed according to standard procedures, e.g. utilizing the restriction ate RID producing 
3' sticky ends inert towards exonuclease m digestion (Mfitbods in RuTymologY. Vol 1S2-1SS 
(1987^ and S. Henikofif, fiene. (1284), supra]. When required, the deletion mutants can be 

20 recircularized and used to transform host cells following standard procedures. Single 
colonic of the transformed host cells are selected, further proliferated, and the deletion 
mutant isolated and immobaized as single-stranded DNAs, similar to the process described 
above and subsequently analyzed by exonuclease mass spectrometric sequencing. 
Alternatively, the immobilized fiill length single sttanded DNA (Tr^ss) can be treated in 

25 time-limited reactions with an exonuclease such as T4 DNA polymerase in the absence of 
NTPs, to create a set of immobilized ordered deletions for subsequent exonuclease mass 
spectrometric sequencing. However, in case the 3* termini are too heterogeneous for direct 
mass spectrometric exonuclease sequencing an intermediate cloning step can be included. In 
yet another embodiment, the exonuclease mediated sequencing can be performed by 

30 providing the single-stranded (ss) nucleic add fiagment in solution. This can be achieved by 
treating the solid support, e.g. T^ss, with an oligonucleotide complementary to the unique 
Asc I site. After hybridization this site is now double-stranded (ds) and susceptible to Asc I 
endonuclease cleavage and release of the single-stimded fiagment 

If a cloning vector such as one of the pGEM femily (PromegaCorp.) is used, 

35 both strands of the double-stranded target DNA can be transcribed separately dependant upon 
which of the specific promoters flankmg the insertion site is used Oocated next to the A or B 
boundary of the insert, HGURE 2) and the conesponding specific DNA dependent RNA 
polymerase (i.e., SP6 or T7 RNA polymerase). These RNA transcripts can be direcUy 
transferred to the sequencing reactor (HGURE 9) for mass spectrometric exonuclease 
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sequencing using an immobilized or entrapped exonuclease. In an alternate embodiment, the 
transcription process is initiated via initiator oligonucleotides with a 5' functionality allowing 
the subsequent immobilization of the Rl^A transcripts to a solid support (H. Kdster, US 
Patent Application No. 08/001,323, supia]; in this case the mass spectrometric sequencing 
5 can be performed within the sequencing reactor using an exonuclease in solution. The 
stepwise liberated ribonucleotides, or modified ribonucleotides (i.e., ribonucleosides 
generated by passing through a reactor containing immobilized alkaline phosphatase), are fed 
to the mass spectrometer for sequoice determination. 

10 

(ii) Introduction of mass-modified nucleotides for multiplex exonuclease 

sequencing: 

Since standard mass spectrometiy is a serial process, the throughput can be 
15 limited. However, in the present invention the throughput can be considerably increased by 
the introduction of mass-modified nucleotides into the DNA or RNA to be sequenced 
allowing for a parallel analysis of several nucleic acid sequences simultaneously by mass 
spectrometry. See H. Kfister, US Patent Application No. 0^00U23.SHPia. Low molecular 
weight nucleic acid components, such as unmodified or mass modified 
20 nucleotides/nucleosides, can be analyzed simultaneously by multiplex mass spectrometry. 

Mass-modified nucleotides can be incorporated by way of mass modified 
nucleoside triphosphates precursors using various methods. For example, one can begin with 
the insert of the target DNA sequence in a single-stranded cloning vector by having a 

"primer" and a "stopper" oligonucleotide bound to the complementary vector sequences 
25 located at the A and B boundary of the insert DNA respectively (FIGURE 3) and a temptate 
directed DNA polymerase, preferentially one lacking the 3'- 5' and 5'- 3' exonuclease activity 
such as Sequenase, version 2.0 (US Biochemicals, derived firom T7 DNA polymerase), Taq 
DNA polymeiase or AMV reverse transcriptase. In the illustrative embodiment, the 
unknown DNA sequence has been inserted in a restriction endonuclease site such as Not I. 
30 Adjacent to fte A boundary another restriction endonuclease site, such as the Asc 1 site, can 
be located within the primer binding site such that the parUy double-stranded circular DNA 
can be cleaved at the unique Asc I site and the mass-modified (-) strand (t^ in FIGURE 3) 
isolated by standard procedures (i.e., membrane filtration, molecular sieving, PAGE or 
agarose gel electrophoresis) and, if desired, coupled to a solid support via a splint 
35 oligonucleotide restoring the Asc I site in double-stranded form for Ugation by T4 DNA 

ligase (FIGURE 3). After having removed the splint oligonucleotide the immobilized single- 
stianded DNA fragment with its B' boundary at the 3' end (i.e., Tr^) is ready for exonuclease 
mediated mass spectrometric sequencing. In another illustrative embodiment, the same 
primer can be used even vAxea the vector has no complementary Asc I site. Although the 
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primer will not hybridize with its 5' terminal sequence to the vector as is shown in FIGURE 
3, it will nevertheless allow the covalent attachment of the single-stranded mass-modified 
DNA to the solid support using the same splint oligonucleotide as described above. In yet 
another embodiment the primer can carry a non-restriction site sequence information at its 5' 
5 end, which may or may not be complementary to the opposite vector sequence, but is 
complementary to a specific splint oligodeoxynucleotide which allows the covalent 
attachment, to the solid support. The latter two procedures do not require cleavage with a 
restriction endonuclease and separation of the strands. 

The reaction mixture obtained after enzymatic synthesis of the mass-modified 

10 (-) strand can be directly joined to the solid support and the circular vector DNA and the 
stopper oligonucleotide can be removed under denaturing conditions. In yet another 
embodiment, the generation of a set of ordered deletions of the target DNA sequence 
information and the incorporation of mass modified nucleotides can be combined by 
terminating the DNA polymerase reaction at different time intervals (i.e., t^, t^, t2, t^, 

15 FIGURE 3) to generate a ladder of mass-modified (-) strands. In case the 3' termini of each 
time point are too heterogeneous for mass spectrometric exonuclease sequencing a 
circuiarizadon and cloning step as described above can be included. 

As illustrated in FIGURES 14A and 14B, both the (+) and (-) stiand can be 
exonuclease sequenced simultaneously. Incorporation of mass-modified nucleotides in to 

20 one ofthe(+) or (-) strands can be carried out as described above. In the illustrative 
embodiment, both the (+) and (-) strands are ligated to solid supports and exonucleased 
simultaneously. The presence of the mass-modified nucleotides in the (-) strand can allow 
for differentiation of mass spectrometric signals arising firom nucleotides released fix)m both 
strands. Where exonuclease sequencing can proceed essentially between the A and B 

25 boundaries in one pass, the sequence of the (-) strand can be inverted and aligned with the 
complimentary sequence. An advantage to this approach is the identification of ambiguous 
sequencing data (i.e., base pair mismatches arising fi*om error of sequencing one of the 
strands). Alternatively, the full sequence can be obtained firom partial exonuclease 
sequencing of both the (+) and (-) strands provided that sequencing proceeds to an overlaping 

30 point on each strand. By searching for the complementary overlapping region of each 

sequence fragment and aligning the two sequence fragments, the sequence of the remainder 
of one or both of the strands can be "reconstructed" based on the known sequence of the 
other. This later example provides a means of seqiiencing m "one pass" a much larger DNA 
fragment then would be possible by exonuclease sequencing only one strand. 

35 In using vectors of the phagemid type (e.g. pGEM ftmily, Promega Corp.) 

both strands of the unknovm DNA fragment can be mass-modified by usmg just the vector 
which carries the fl origin of replication in the opposite direction. 

As further embodiments of this mvention and in analogy to the reactions 
described above, RNA transcripts of both strands can be obtained utilizing, for example, 

SUBSnrUTE SHEET (RULE 26) 
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transcription promoter regions flanking the insertion site, restoring the double-stranded 
promoter site by complementaiy oligonucleotides [Methods in BnzvmolQBY. Vol. 1 85, 
iimy, Uhlenbeck et al., N..rldc Acids -Res.. (1282), SUpm] and transcribing with apprppriate 
RNA polymerases in the presence of mass-modified ribonucleoside triphosphates. As above, 
5 the mass-modified RNA can be directly transferred to the sequencing reactor for mass 
spectrometric sequencing using an immobilized or entrapped exonuclease. In another 
embodiment the transcription can be initiated with initiator oligonucleotides [Krupp et al.. 
Gene. (1222), SUm] carrying a 5' functionality for subsequent attachment of the mass- 
modified RNAs to a solid support. In the later instance, the immobilized mass-modified 
10 RNAs can be contacted in the sequencing reactor (FIGURE 9) with an exonuclease in 

solution for mass spectrometric sequencing. 

The mass-modification of the immobilized strand starting with the unknown 
DNA insert in a double-stranded vector (FIGURE 4A) can be introduced starting with a 
situation similar to TrOds in HGURE 2. However, a 5' phosphorylated exonuclease III 

15 resistant splint oligonucleotide (i.e., 2'3' dideoxy) is Ugated to the (-) strand allowing a 
unilateral digestion of the (+)stiand with exonuclease III (FIGURE 4A). The mass- 
modifications are tiien introduced by a filling-in reaction using tonplate depoident DNA 
polymerases such as Sequenase, version 2.0 (US Biochemicals), taq DNA polymerase or 
AMV reverse transcriptase and appropriate mass-modified dNTPs. In anotiier embodiment 

20 one can start with a situation similar to TrOss in HGURE 2 and by using a (-) primer 

designed to bind outside tiie A boundary at tiie 3' end of the (+) strand, syntiiesize a umss- 
modified (-) strand employing mass-modified dNTPs and a DNA dependent DNA 
polymerase as described above. In one embodiment, there can be a short stretch of sequence 
between the Not I and tiie Asc I site to aUow tiiis primer to hybridize effectively. This 

25 approach can also be carried out by generating a mass modified (+) strand starting witii Tr^ ss 
(FIGURE 4B). The newly syntiiesized (+) strand can be isolated from the (-) strand solid 
support, such as by denaturation, and immobilized via the 5' sequence information of tiie 
primer and a splint oligonucleotide which is in part complementary to tiiis and to an 
oligonucleotide sequence already attached to anotiier solid support (FIGURE 4B). After 

30 ligation (i.e., witii T4 ligase) tiie spUnt oUgonucleotide is removed and tiie immobilized mass- 
modified single-stranded (+) DNA is transferred to tiie sequencing reactor (FIGURE 9) and 
contacted with an exonuclease, such as T4 DNA polymerase in solution, for mass 
spectrometric sequence determination via tiie released mass-modified nucleotides. 

In accordance witii this invention, tiie mass-modifying functionality can be 

35 located at different positions witiiin tiie nucleotide moiety OPIGURE 5 and 6). See also H. 
K6ster, US Patent AppUcation No. 08/001323, for fiirtiier examples and syntiiesis 
chemistries. For instance, tiie mass-modifying fimctionality can be located at tiie heterocycUc 
base at position C-8 in purine nucleotides (Ml) or C-8 and/or C7 (M2) in c7-deazapurine 
nucleotides and at C-5 in uracU and cytosine and at tiie C-5 metiiyl group at tiiymine residues 
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(Ml). Modifications in these positions do not interfere with Watson-Crick specific base- 
pairing necessary for the enzymatic incorporation into the mass-modified nucleic acids 
(DNA/RNA) with high accuracy. Modifications introduced at the phosphodiester bond (M4), 
such as with alpha-thio nucleoside triphosphates, have the advantage that these modifications 
5 do not interfere with accurate Watson-Crick base-pairing and additionally allow for the one- 
step post-synthetic site-specific modification of the complete nucleic acid molecule e.g. via 
alkylation reactions [K.L. Nakamaye, G. Gish, F. Eckstein and H.-P. Vossberg, Nufiifiic 
Acids Res .. If, 9947-59 (1988)]. However, this modification is not applicable where the 
exonucleolytically released nucleotides are to be treated with immobilized alkaline 

10 phosphatase intermediate release and mass spectrometric detection. Mass modification can 
also occur at the sugar moiety, such as at the position C-2' (M3). Modifications at this 
position can be introduced with the purpose of modulating the rate of exonuclease activity in 
order to synchronize the degradation process from time to time. The modification M4 can 
also serve this purpose. For example, it is known [Burgers and Eckstein. (1222), SUJJO] that a 

15 phosphodiester bond carrying a monothio fiinction is approximately 100 time less sensitive 
towards exonucleolytic degradation by exonuclease m. 

The tables in FIGURES 7 and 8 depict some exanqiles of mass-modi^ring 
functionalites for nucleotides. This list is, however, not meant to be limiting, since numerous 
other combinations of mass modifying functions and positions within the nucleotide 

20 molecule are possible and are deemed part of the invention. The mass modifying 

fimctionality can be, for example, a halogen, an azido, or of the type XR. Wherein X is a 
linking group and R is a mass modifying fimctionality. The mass modifying fimctionality 
can thus be used to introduce defined mass increments into the nucleotide molecule. 

Without limiting the scope of the invention, the mass modification, M, can be 

25 introduced for X in XR as well as using oligo-Zpolyethylene glycol derivatives for R. The 
mass modifying increment in this case is 44, i.e., five different mass modified species could 
be generated by just changing m fiom 0 to 4 thus adding mass units of 45 (nF=0), 89 (m»l), 
133 (m=2), 177 (m=3) and 221 (m=4). The oligo/polyethylene glycols can also be 
monoalkylated by a lower alkyl such as methyl, ethyl, propyl, isopropyl, t-butyl and the like. 

30 A selection of linking fimctionalities X are also iUustiated in RGURE 8. Other chemistries 
can be used in the mass modified compounds, as for exanq)le, those described recently in 
Oligonucleotides and Analogues, A Practical Approach, F. Eckstein, editor, IRL Press, 
Oxford, 1991. 

In yet anotiier embodiment, various mass modifying functionalities R, other 
35 tiian oligo/polyethylene glycols, can be selected and attached via appropriate linking 

chemistries X. A simple mass modification can be achieved by substituting H for halogens 
like F, CI, Br and/or I; or pseudohalogens such as SCN, NCS; or by using different alkyl, aryl 
or aralkyl moieties such as metiiyl, etiiyl, propyl, isopropyl, t-butyl, hexyl, phenyl, substituted 
phenyl, benzyl; or functional groiqjs such as CH2F, CHF2, CF3, Si(CH3)3, 
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Si(CH3)2(C2H5), Si(CH3)(C2H5)2, Si(C2H5)3 • Yet another mass modification can be 
obtained by attaching homo- or heteropeptides through X to the nucleotide. One example 
useful in generating mass modified species with a mass increment of 57 is the attachment of 
oligoglycines, e.g. mass modifications of 74 (r=l, m=0), 131 (i=l, m=2), 188 (r=l, m=3), 
5 245 (r=l,m=4) are achieved. Simple oligoamides also could be used, e.g. mass 

modifications of 74 (i=l, m=0), 88 (i=2,,m=0). 102 (r=3, m=0), 116 (i=4, m=0) etc. are 
obtainable. For those skilled in the art it will be obvious that there are numerous possibilities, 
for introducing, in a predetennined manner, many different mass modifying functionalities to 
the nucleotide. 

10 In yet another embodiment of this invention, the mass modifying functionality 

can be introduced by a two or multiple step process. In this case tiie nucleotide is, in a first 
step, modified by a precursor functionality such as azido, -N3, or modified with a functional 
group in which tiie R in XR is H thus providmg temporary functions e.g. but not limited to - 
OH.-NH2..NHR,.SH,.NCS,-OCO(CH2)rCOOH(r= 1.20),.NHCO(CH2)iCOOH(r= 1- 

15 20). -OSO2OH. ^0(CH2)rI (r - 1-20), -0P(0-AIkyl)N(Alkyl)2. These less bulky 
functionalities result in better substrate properties for enzymatic DNA or RNA syntiiesis 
reactions. The appropriate mass modifying functionaliQr can then be intinduced after the 
generation of tiie target nucleic acid prior to mass spectrometry and eitiier prior to 
exonuclease degradation or after release by exmudease action. 

20 

(iii) The exonuclease sequencer. 

A schematic outlay of an exonuclease sequencer is shown in FIGURE 9. The 
central part is tiie reactor S \Adcb has a coolmg/heating mantie (2) and a fiit or 

25 semipermeable membrane (4). Several flasks (R1-R5) can be dedicated to supply reagents 
such as buffer solutions, enzyme, ete. tiiiough a cooling/heating coil T. Beneath tiie reactor S 
tiiere is a capillaiy tube E in v^ch eitiier flie exonuclease or ti» nucleic acids can be 
immobaized. It is witiun tiie scope of this invention tiiat tiiere are at least two difBerent 
modes by which tiie system can be operated. In one mode, tiie nucleic acids are unmobilized 

30 on beads or flat membrane disks placed m tiie reactor S, or alternatively, immobilized on tiie 
inner surfece of tiie walls witiiin tiie capillary E. Exonuclease is added to tiie reactor in a 
controUed manner and tiie reaction mixture circulated tiirough a loop maintained at a 

carefully controlled temperatare. In a second mode, tiie exonuclease can be immobilized in 
e.g. a ciq}illary E beneath tiie reactor S or could be immobilized on beads or on a membrane 
35 or entrapped in a gel or kept in tiie reactor S by vray of a semipermeable membrane. By 
varying tiie lengtii and diameter of tiie capiUary and tiie flow rate tiirough a pumping device 
P, the contact time of the nucleic acids with the exonuclease can be varied. 

In botii process modes, aliquots can be fed eitiier continuously or m pulses to 

tiie mass qiectrometer eitiier directiy or tiirough a reactor AP which contains, for mstance, 
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immobilized alkaline phosphatase or other mass-modifying reagents. In case the liquid 
volume which is transferred to the mass spectrometer is too large, only a portion can be 
supplied while the remainder is separated from the flow stream by using a flow splitting 
device SP. Unused or excess solutions can be disposed of by the waste container W. In case 
5 the reaction mixture of the exonuclease digestion is processed via a movmg belt the liquid 
flow can be directed through this module prior to entering the mass spectrometer. 

(iv) The exonuclease sequencing process: 

10 Various exonucleases can be used, such as snake venom phosphodiesterase, 

Bal-31 nuclease, E. coli exonuclease VII, Mung Bean Nuclease, SI Nuclease, exonuclease 
and exonuclease III as well as the exonuclease activity of some DNA polymerases such as E. 
coli DNA polymerase I, the Klenow fragment of DNA polymerase I, T4 or T7 DNA 
polymerases, Taq DNA polymerase. Deep Vent DNA polymerase, and Ventf DNA 

1 5 polymerase. The activity of these exonucleases can be modulated, for instance, by shifting 
oflf the optimal pH and/or temperature range or by adding poisoning agents to the reaction 
mixture. The exonuclease activity can also be modulated by way of functional groups, such 
as at the C-2' position of tiie sugar moiety of the nucleotide building block or at the 
phosphodiester bond (i.e., M3/M4 in FIGURE S and 6). 

20 In the instance that unmodified nucleotides are detected, the masses for the 

phosphate dianion are 329209 for pdG, 313210 for pdA, 304.196 for pdT and 289.185 for 
pdC. In an idealized system the en^mMtic digestion would be initiated at all nucleic acid 
chains at the same time, and the nucleotides would be released in identical time intervals d 
and detected by their individual molecular weights one after the other by the mass 

25 spectrometor. FIGURE 1 0 illustrates the signals versus time for the sequence 5'...A-T -C-C- 
G.G-A3*. 

The influence of an activity modulating functionality M3/M4 on appropriately 
modified thymidine, T*, is also depicted. Due to the drastically reduced cleavage rate of the 
phosphodiester bond between dC and dT* the molecular mass representing the T* signal 
30 appears after a longer time interval f The significant retardation of the cleavage rate of one 
type of nucleotide results in better overall synchronization of the cazyma&c process. This 
retardative effect can be varied to a large extent by the buUdness of the modifying fimctional 
group as well as by a possible interaction with the active site of the exonuclease. 
Additionally, partial overli^ of signals can be resolved by computational methods. 

35 

(v) Multiplex exonuclease sequencing: 

A significant increase in throughput can be further obtained by employing the 
principle of multiplex exonuclease sequencing. The principle of this concept is illustrated in 
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FIGURE 11. For multiplex mass spectrometric exonuclease DNA sequencing, the DN A 
fragments to be processed in parallel can be identified by fragment specific labels introduced 
by the mass modifying functionality The target nucleic acid sequences can be mass- 
modified by using, for example, unmodified dNTPOs or NTpOs (TiO), nucleoside ^ 

5 triphosphates mass-modified with the same fimctional group, such as an additional methyl 
group at the heterocyclic base, using either dNTPl or NTPl (TrO), mass difference large 
enough to be discriminated firom the nucleotides of TrO or Trl, such as e.g. an ethyl group at 
the heterocyclic base, by employing either dNTp2 or NTP2 etc. Thus i modified DNA 
fragments can be simultaneously exonuclease sequenced. For example, the i dififerent DNA 

10 fragments can be immobilized on different membranes and a stack of such membranes placed 
into the reactor S (FIGURE 9) for simultaneous exonuclease mass spectrometric sequencing. 
Since the individual molecular weights of the various mass-modified four nucleotides are 
known in advance, the mass spectrometrically detected nucleotide masses can be easily 
assigned to the patent nucleic acid fragments and thus several sequences can be detected 

15 simultaneously by the mass spectrometer. Even in the worst case when the same nucleotide, 
e.g. dT is at the same position in all sequences, processed in parallel Ae signal can be 
decoded due to the difference in mass between dT^, dTl, dT2, dT^, dP. 

The synchitmization of exonuclease action can be improved by incorporating 
-modified nucleotides (modified at C-T or at the phosphodiester bond) into the otherwise 

20 unmodified or mass-modified nucleic acid fiagments as set out above. In particular such a 
mass-modified nucleotide can also be introduced at the 3' end of the single stranded nucleic 
acid fragment (i.e., C-2' or phosphodiester bond modifications) to achieve a more uniform 
initiation of the exonuclease reaction. 

In yet another embodiment of this invention, a reduction of overiap of 

25 neighboring signals can be achieved by using a moving belt device as schematicaUy shown m 
nOURE 12. In a recent pubUcation a moving belt has been described although in a 
completely unrelated application [MMoini and F.P. Abramson. Biftlngical Mass 
Spectrnmetrv. 2fl, 308-12 (1 991)]. Tlw effect of the moving belt can be to help spread the 
appearance of sequentiaUy released nucleotide/nucleoside signals as iUustrated m HGURE 

30 13. The width between consecutive, i.e.,ndghboring, signals can be inoeased with the s^ 

of die moving belt 

Without limiting the scope of the mvention HGURE 12 Ulustiates a possible 
configuration of die moving belt module for exonuclease mediated mass spectrometric 
sequencing. An endless metal ribbon (10) is driven witii variable speed using a controllable 
35 stepping motor and appropriately positioned pulleys. Spring-loaded pulleys can be used to 
maintain a sufBcientiy high tension on the moving belt. The sample is ^lied to the belt 
from die reactor module A (FIGURE 9) at a position which is in direct contact with a 
coolmg/heating plate B. In case matrix-assisted laser desoiption/ionization mass 
spectrometry is employed the sample can be mbced witii a matrix solutionM prior to loading 
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onto the belt. Crystal formation can be observed with a viewing device D (CCD camera and 
optics). Alternatively the container M can be used to mix the sample with a diluent or other 
reagent, en2yme, internal standard etc. at a mixing valve or vortex (12). In the instance of 
relatively small molecules such as the released nucleotides, matrix is not essential for the 
5 laser desorption/ionization process to take place. The belt 1 0 moves the sample under a laser 
source E (with appropriate optics) for desorption and ionization. 

A heating element C, such as a microwave source, can be placed near the 
surface of the returning belt, separated from the forward moving belt by an insulating shield 
I, to clean the surface of the metal ribbon belt 10 of any organic material prior to loading a 

10 new sample. Alternatively, a washing station W can be integrated before the heating element 
C; in this case the function of the heating element C can be completely dry the metal ribbon 
prior to reloading of sample. Before and after the laser targets the sample, two differential 
vacuum pumping stages F and G are positioned. An electric field is ^lied after the second 
vacuum stage to accelerate the ions into the mass spectrometer. As mass analyzer, a 

1 5 quadnipole can be used, though other mass analyzing configurations are known in the art and 
are within the scope of the invention. The design ofthe vacuum interfece of the moving belt 
between the sample application compartment which is at atmospheric pressure, and the mass 
spectrometer can be important. In one approach, this vacuum seal can be provided by the use 
of tunnel seals in a two-stage vacuum lock as previously described [Moini et al, (1?91) , 

20 2U|2Ib]. 

As described above, an increase of throughput can be obtained by 
multiplexing. In yet another embodiment of the invention the moving belt device can be used 
for a second dimension in multiplexing by applying s samples from s sequencing reactors A 
(FIGURE 9) simultaneously in different locations onto the moving belt. Desorption and 

25 ionization of these multiple samples is achieved by moving the laser beam with adjustable 
speed and frequency back and forth perpendicular to the direction of the moving belt. 
Identification and assignment of the nucleotides/nucleosides detected to the various nucleic 
acid fragments can be achieved by second dimension multiplexing, i.e., by mass-labeling the 
nucleic acid fragments with for example .CH2-, -CH2CH2-, -CH2CH2CH2- and - 

30 CH2(CH2)rCH2-, and labeling the different reactors, for example, by individual halogen 
atoms, i.e., F for reactor 1 (thus having the different DNA fragments labeled with -CH2F, • 
CH2CH2F, .CH2CH2CH2F, -CH2(CH2)rCH2F), CI for reactor 2 (thus having the different 
DNA fragments labeled with ^H2C1, .CH2CH2CI, •CH2CH2CH2CI, -CH2(CH2)rCH2Cl), 
Br for reactor 3 etc. This can increase the throughput dramatically. Two-dimensional 

35 multiplexing can be applied in different ways. For example, it can be used to simultaneously 
sequence the fi-agments forming one set of overlapping deletions of the same long DNA 
msert In another embodunent, several subsets of ordered deletions of different DNA inserts 
can be analyzed in parallel. 
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The enormous advantage of exonuclease mediated mass spectrometric DNA 
sequencing is that small molecules are analyzed and identified by mass spectrometry. In this 
mass range, the accuracy of mass speedometers is routinely very high, i.e., 0.1 mass units are 
easily detected. This increases the potential for multiplexing as small differences in mass can 

5 be detected and resolved. An additional advantage of mass spectrometric sequencing is that 
the identified masses can be registered automatically by a computer and by adding the time 
coordinate automatically aligned to sequences. Since the sequences so determined are 
memorized (i.e., saved to disk or resident in the computer memory) appropriate existmg 
computer programs operating in a multitasking environment can be searching in the 

10 "background" (i.e., during continuous getieration of new sequence data by the exonuclease 
mass spectrometric sequencer) for overlaps and generate contiguous sequence infonnation 
which, via a link to a sequence data bank, can be used in homology searches, etc. 

Another aspect of this invention concerns kits for sequencing nucleic acids by 
exonuclease mass spectiometiy, which include combinations of the above described 

15 sequencing reactants. For instance, in one embodiment, the kit comprises reagents for 

multiplex mass spectrometric sequencing of several different species of nucleic acid. The kit 
can include an exonuclease for cleaving the nucleic acids unilaterally from a first end to 
sequentiaUy release individual nucleotides, a set of nucleotides for synthesizmg the different 
species of nucleic adds, at least a portion of the nucleotides bemg mass-modified such that 

20 sequentially released nucleotides of each of tiie different species of nucleic acids are 
distinguishable, a polymerase for synthesizing tiie nucleic acids from complementary 
templates and the set of nucleotides, and a soUd support for immobiUzing one of the nucleic 
acids or ti»e exonuclease. The kit can also include appropriate buffers, as well as instructions 
for performing multiplex mass spectrometry to concurrentiy sequence multiple species of 

25 nucleic acids. In anotiier embodiment, tiie sequencing kit can include an exonuclease for 
cleaving a target nucleic add unilaterally from a first end to sequentiaUy release individual 
nucleotides, a set of nudeotides for syntiiesizing tiie different spedes of nucldc acids, at 
least a portion of tiie nucleotides being mass-modified to modulate tiie exonuclease activity, a 
polymerase for syntiiesizing tiie nucleic add from a complementary template and tiie set of 

30 nucleotides, and a soUd support for unmoWlizing one of tiie nucldc acids or tiie exonudease. 

Anotiier aspect of tins invention concerns a "reverse-Sanger" type sequencing 
metiiod using exonuclease digestion of nucleic acids to produce a ladder of nested digestion 
fragments detectable by mass specttometry. For instance, as above, tiie target nudeic acid 
can be immobilized to a solid support to provide unilateral degradation of tiie chain by 

35 exonuclease action. Incorporating into tiie target nucleic acid a limited number of mass- 
modified nudeotides which inhibit tiie exonuclease activity (i.e., protect an mdividual nucleic 
acid diam from further degradation) can result m a ladder of nested exonuclease fragments. 
See Labeit et al. (1986) DMA^:173; Eckstein et al. (1988) Nnrlpifi Acid Rtt. 16:9947; and 
PCT AppUcation No. GB86/00349. The nested exonuclease fragments can tiien be rdeased 
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from the solid support (i.e., via a cleavable linkage) and the molecular weight values for each 
species of the nested fragments determined by mass spectrometry, as described in U.S. Patent 
application No. 08/001,323. From the molecular weight values detennined, the sequence of 
the nucleic acid can be generated. It is clear that many variations of this reaction are possible 
5 and that it is amenable to multiplexing. For example, the target nucleic acid need not be 
bound to a solid support, rather any protecting group can be used to ensure unilateral 
exonuclease degradation. Where mass-modified nucleotides are used which have large 
enough molecular weight differences to be differentiated between by mass spectrometry (i.e., 
the termmation of a chain with a particular mass-modified nucleotide is discemable from all 

10 other tenninations), the exonuclease sequencing can be carried out to create only one set of 
nested fragments. Alternatively, individual types of exonuclease-inhibiting nucleotides can 
be incorporated in separate reactions to create sets of nested fragments. For instance, four 
sets of nested fragments can be separately generated wherein one set tenninates with mass- 
modified A's, one set terminates in mass-modified G% etc. and the total sequence is 

1 s determined by aligning the collection of nested raonuclease fragments. 

EXAMPLE 1 

TtninnWIigatio Ti of nucleic aridg tn solid supports via disulfide bonds. 

As a solid support, SEQUELON membranes (Millipore Corp., Bedford, MA) 

20 with phenyl isothiocyanate groups is used as a starting material. The membrane disks, with a 
diameter of 8 nun, are wetted with a solution of N-methylmorpholme/watery2-propanol 
(NMM solution) (2/49/49;v/v/v), the excess liquid removed with filterpaper and placed on a 
piece of plastic film or aluminium foil located on a heating block set to 55®C. A solution of 
1 mM 2-mercaptoethylamine (cysteamine) or 2,2'-dithio-bis(ethylamine) (cystamine) or S-(2- 

25 thiopyridyl)-2-thiQ-ethylamme (10 ul, 10 nmol) in NMM is added per disk and heated at 
550c. After 15 min 10 ul of NMM solution are added per disk and heated for another 5 min. 
Excess of isothiocyanate groups may be removed by treatment with 10 ul of a 10 mM 
solution of glycme in NMM solution. In case of cystamine the disks are treated with 10 ul of 
a solution of IM aqueous dithiothreitol (DTT)/2-propanol (1:1, v/v) for 15 min at room 

30 temperature. Then the disks are thoroughly washed in a filtration manifold with 5 aliquots of 
1 ml each of the NMM solution, then with 5 aliquots of 1 ml acetonitrile/water (1/1; v/v) and 
subsequently dried. If not used inmiediately the disks are stored with fi:ee thiol groups in a 
solution of IM aqueous dithiothreitol/2-propanol (1:1; v/v) and, before use, DTT is removed 
by three washings with 1ml each of the NMM solution. Single-stranded nucleic acid 

35 fragments with 5 -SH functionality can be prepared by various methods [e.g. B.C.F Chu et 
al., Nucl eic Acids Res., 14, 5591-5603 (1986), Sproat et al.. Nucleic Acids Res., 15, 4837-48 
(1987) and Oligonucleotides and Analogues. A Practical Approach (F. Eckstein editor), IRL 
Press Oxford, 1991]. The single-stranded nucleic acid fragments with free 5 -thiol group are 
now coupled to the thiolated membrane supports under mild oxidizing conditions. In general 
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it is sufficient to add the 5*-thiolated nucleic acid fragments dissolved in 10 ullO mM de- 
aerated triethylammonium acetate buffer (TEAA) pH 7.2 to the thiolated membrane supports; 
coupling is achieved by drying the saiflples onto the membrane disks with a cold fan. This 
process can be repeated by wetting the membrane with 1 0 ul of 1 0 mM TEAA buffer pH 7.2 
5 and drying as before. When using the.2-thiopyridyI derivatized compounds anchoring can 
be monitored by the release of pyridinerl-thione spectrophotometrically at 343 nm. 

In another variation of this approach the single-stranded nucleic acid is 
fimctionalized with an amino group at the 5'-end by standard procedures. The primary amino 
group is reacted with 3-(2-pyridyldithio)propionic acid N-hydroxysuccinimide ester (SPDP) 

10 and subsequently coupled to the thiolated supports and monitored by the release of pyridyl-2- 
thione as described above. After denaturation of any remaming protein and ethanol 
precipitation of the fimctionalized nucleic acid, the pellet is dissolved in 10 ul 10 mM TEAA 
buffer pH 7.2 and 10 ul of a 2 mM solution of SPDP in 10 mM TEAA are added. The 
reaction mixture is vortexed and incubated for 30 min at 25oC; excess SPDP is then removed 

15 by three extractions (vortexing, cientrifugation) with 50 ul each of ethanol and the resulting 
pellets dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and coupled to the thiolated supports 
(see above). 

The imobilized nucleic acids can be released by three successive treatments 
with 10 ul each of 10 mM 2-mercaptoefhanol in 10 mM TEAA buffer pH 72. 

20 

EXAMPLE2 

Tmmobilimtio n nf nucleic acids on solid sunnort via a levulinvl group. 

5-AminolevuIinic acid is protected at the primary amino group with the Fmoc 
group using 9-fluorenylmethyl N-succinimidyl carbonate and then transformed into the N- 

25 hydroxysuccmimide ester (NHS ester) usmg N-hydroxysuccinimide and dicyclohexyl 

carbodiimide under standard conditions. Nucleic acids which are fimctionalized with primary 
amino acid at the 5' end are EtOH precipitated and resuspended in 10 ul of 10 mM TEAA 
buffer pH 7.2. 10 ul of a 2 mM solution of the Fmoc-5-aminolevulinyl-NHS ester in 10 mM 
TEAA buffer is added, vortexed and incubated at 25^>C for 30 niin. The excess of reagents 

30 can be removed by ethanol precipitation and centrifugation. The Fmoc group is cleaved off 
by resuspending the pellets in 10 ul of a solution of 20% piperidine m N,N- 
dimethylformamide/water (1:1, v/v). After 15 min at 25^0 piperidine is thoroughly removed 
by three precipitations/centrifiigations with 100 ul each of ethanol, the pellets resuspended in 
10 ul of a solution of N-methyhnorpholine, propanol-2 and water (2/10/88; v/v/v) and 

35 coupled to the solid support carrying an isothiocyanate group. IncaseoftheDITC-Sequelon 
membrane (Millipore Corp., Bedford, MA) the membranes are prepared as described in 
EXAMPLE 1 and coupling is achieved on a heating block at 55^0 as described above. The 
procedure can be applied to other solid supports with isothiocyanate groups in a similar 
manner. 
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The immobilized nucleic acids can be released from the solid support by three 
successive treatments with 1 0 ul of 1 00 mM hydrazinium acetate buffer pH 6.5. 

EXAMPLES 

5 Immobilizati o n of nucleic acids on solid <njpnorts via a trvnsin sensitive linkage. 

Sequelon DITC membrane disks of 8 mm diameter (Millipore Corp. Bedford, 
MA) are wetted with 10 ul of NMM solution (N-methylmorpholine/propanaol-2/water; 
2/49/49; v/v/v) and a linker arm introduced by reaction with 10 ul of a 10 mM solution of 1,6- 
diaminohexane in NMM. The excess of the diamine is removed by diree washing steps with 

10 100 ul of NMM solution. Using standard peptide synthesis protocols two L-lysine residues 
are attached by two successive condensations with N -Fmoc-N -tBoc-L-lysine 
pentafluorophenylester, the terminal Fmoc group is removed with piperidine in NMM and the 
free e-amino group coupled to 1 ,4-phenylene diisotfiiocyanate (DITC). Excess DITC is 
removed by three washing steps with 100 ul propanol-2 each and the N -tBoc gfoups 

15 removed with trifluon>acetic acid according to standard peptide synthesis pr^ The 
nucleic acids are prepared from as above 6om a primary amino group at the 5 -teraunus. The 
ethanol precipitated pellets are resuspended in 10 ul of a solution of N*metfaylmorpholine, 
propanol-2 and water (2/10/88; v/v/v) and transfened to the Lys-Lys-DITC membrane disks 
and coupled on aheating block set at SS^C. After drying 10 ul of NMM solution is added 

20 and the dryingprocess repeated. 

The unmobilized nucleic acids can be cleaved from the solid support by 
treatment with trypsin. 

EXAMPLE4 

25 Immohilization of nucleic ac ids on solid supports via PVronhosnhatC linkage, 

The DITC Sequelon membrane (disks of 8 mm diameter) are prepared as 
described in EXAMPLE 3 and 10 ul of a 10 mM solution of 3-aminopyridine adenine 
dinucleotide (APAD) (Sigma) in NMM solution added The excess of APAD is removed by 
a lOulwashofNMM solution and the disks are treated with 10 ul of 10 mM sodium 

30 periodate in NMM solution (15 min, 25<>C). Excess of periodate is removed and the having a 
primary amino group at the S'*end are dissolved in 10 ul of a solution of N- 
methyhnorpholine/propanol-2/water (2/10/88; v/v/v) and coupled to the Z,3'-dialdchydo 
functions of the immobilized NAD analog. 

The ommobilized nucleic acids can be released from the solid support by 

35 treatment with either NADase or pyrophosphatase in 1 0 mM TEAA buffer at pH 7.2 at 37oC 
forlSmin. 
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EXAMPLE 5 

f^ynthi^gig nf pvrimidinft niiclentid e <t mass mndified at C-5 of the heterocyclic hflSC with 
glycine residues. 

Starting material is 5-(3-aminopropynyl-l)-3',5'-di-p-tolyldeoxyuridine 
5 prepared and 3',5'-de-0-acylated according to literature procedures [Haralambidis et al., 
Nucleic Acids Res., 15, 4857-76 (1987)]. 0.281 g (1.0 mmole) 5.(3-aminopropynyl-l)-2'- 
deoxyuridine are reacted with 0.927 g (2.0 mmole) N-Fmoc-glycine pentafluorophenylester 
in 5 ml absolute N,N-dimethylformamide in the presence of 0.129 g (1 mmole; 174 ul) NJ^I- 
diisopropylethylamine for 60 min at room temperature. Solvents are removed by rotary 

10 evaporation and the product purified by silica gel chromatography (Kieselgel 60, Merck; 
column: 2.5x 50 cm, elution with chloroform/methanol mixtures). Yield 0.44 g (0.78 mmole, 
78 %). In order to add another glycine residue the Fmoc groi^ is removed with a 20 min 
treatment with 20 % solution of piperidine in DMF, evtqjorated in vacuo and the remaining 
solid material extracted three times with 20 ml ethylacetate; after having removed the 

15 remaining ethyhicetate N-Fmoc-glycine pentafluorophenylester is being coupled as described 
above. This glycine modified thymidine analogue building block for diemicalDNA 
synthesis can be used to substitute for thymidme or uridine nucleotides in Ae target nucldc 
acid. 

20 EXAMPLE6 

f^yntWQ nf pvrimidine n u rli^fid^s mass mndified at C-5 of the heteTOCYCHc bflSC Wlth B- 
aliinine residues. 

Starting material is the same as in EXAMPLE 5. 0281 g (1.0 mmole) 5-Q- 
Aminopropynyl-l)-2'-deoxyuridine is reacted with N-Fmoc-p-alani^e pentafluorophenylester 

25 (0.955 g, 2.0 mmole) in 5 ml N,N-dimethylforaiamide (DMF) in the presence of 0.129 g (174 
ul; 1 .0 mmole) NJJ-diisopropylethylamine for 60 min at room temperature. Solvents are 
removed and the product purified by silica gel chromatography as described in EXAMPLE 6. 
Yield: 0.425 g (0.74 mmole. 74 %). Another p-ahuiine moiety could be added in exactly the 
same way after removal of the Fmoc group. This building block can be substitute for any of 

30 die tiiymidine or uridine residues in the target nucleic acid. 

EXAMPLE7 

<iYntli>»ci^ nf a pvrimidinft nnclentide m a ss mndified at P-S of the heterocyclic base With 
. ffthvlene glvcnl mmiomethvl ether. 

35 As nucleosidic component 5-(3-aminopropynyl-l>2'-deoxyuridine is used in 

tills example (see EXAMPLE 5 and 6). The mass-modifying functionality is obtained as 
follows: 7.61 g (100.0 mmole) fteshly distilled etiiylene glycol monometiiyl etiier dissolved 
in 50 ml absolute pyridine is reacted witii 10.01 g (100.0 mmole) recrystallized succinic 
anhydride in tiie presence of 1 .22 g (10.0 mmole) 4-N,N-dimethylaminopyridine overnight at 
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room temperature. The reaction is terminated by the addition of water (5.0 ml), the reaction 
mixture evaporated in vacuo, co-evaporated twice with dry toluene (20 ml each) and the 
residue redissolved in 100 ml dichloromethane. The solution is extracted successively, twice 
with 10 % aqueous citric acid (2 x 20 ml) and once with water (20 ml) and the organic phase 

5 dried over anhydrous sodium sulfate. The organic phase is evaporated in vacuo, the residue 
redissolved in 50 ml dichloromethane and precipitated into 500 ml pentane and the 
precipitate dried in vacuo. Yield: 13.12 g (74.0 mmole; 74 %). 8.86 g (50.0 mmole) of 
succinylated ethylene glycol monomethyl ether is dissolved in 100 ml dioxane containing 5 
% dry pyridme (5 ml) and 6.96 g (50.0 mmole) 4-nitrophenol and 10.32 g (50.0 mmole) 

10 dicyclohexylcarbodiimide is added and the reaction run at room temperature for 4 hours. 
Dicyclohexylurea is removed by filtration, the filtrate evaporated in vacuo and the residue 
redissolved in 50 ml anhydrous DMF, 12.5 ml (about 12.5 nunole 4-nitrophenylester) of this 
solution is used to dissolve 2.81 g (10.0 mmole) 5-(3-aminopropynyl-l)-2'-deoxyuridine. 
The reaction is performed in the presence of 1.01 g(10.0 mmole; 1.4ml)triethylamineat 

1 5 room temperature overnight The reaction mixture is evaporated in vacuo, co-ev«q)orated with 
toluene, redissolved in dichloromethane and chromatognq)hed on silicagel (Si60, Merck; 
column 4x50 cm) with dichlorometfaane/melhanol mixtures. The fiactions containing the 
desired compoimd are collected, evaporated, redissolved in 25 ml dichloromethane and 
precipitated into 250 ml pentane. 

20 

EXAMPLES 

Rvnthi^Qi Q nf pvrimidine niiclentides tna^-modififid at T-S of the heterocyclic base with 
diethvlene givcnl monomethvl ether. 

Nucleosidic starting material is as in previous examples, 5-(3-aminopropynyl- 

25 l>2'-deoxyuridine. The niass-modifyingfimctionality is obtained sirnilar to EXAMPLE 7. 
12.02 g (100.0 mmole) fireshly distilled diethylene glycol monomethyl ether dissolved in 50 
ml absolute pyridme is reacted with 10.01 g (100.0 mmole) reciystallized succinic anhydride 
in the presence of 1 .22 g (10.0 mmole) 4-N,N-dimethylaminopyridine (DMAP) overnight at 
room temperature. The work-up is as described in EXAMPLE 7. "Yield: 18.35 g (82.3 

30 mmole, 82.3 %). 1 1.06 g (50.0 mmole) of succinylated die&ylene glycol monomethyl ether 
is transformed into the 4-nitrophenylester and subsequently 12.5 mmole reacted with 2.81 g 
(10.0 mmole) of 5-(3-anunopropynyH)-2'-deoxyuridine as described in EXAMPLE 7. Yield 
after silica gel colunm chromatography and precipitation into pentane: 3.34 g (6.9 mmole, 69 
%). 



35 
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EXAMPLE 9 

{^vnth^cis nf rienxvadenn^ine mass-m o HifipH at P.-R of the heterocyclic hflSP with glVCine . 

Starting material is N6-ben2oyl-8-bromo-5'-0-(4,4'-diinethoxytrityl)-2'- 
deoxya denosine prepared according to literature [Singh et al., Nucleic Acids Res. 1 8, 3339- 
5 45 (1990)]. 632.5 mg (1.0 mmole) of this 8-bromo-deoxyadenosine derivative is suspended 
in 5 ml absolute ethanol and reacted with 251.2 mg (2.0 mmole) glycine methyl ester 
(hydrochloride) in the presence of 241.4 mg (2.1 mmole; 366 ul) N,N-diisopropylethylamine 
and refluxed until the starting nucleosidic material has disappeared (4-6 hours) as checked by 
thin layer chromatography (TLC). The solvent is evaporated and the residue purified by 
10 silica gel chromatography (column 2.5x50 cm) using solvent mixtures of 

chloroform/methanol containing 0.1 % pyridine. The product fiactions are combined, the 
solvent evaporated, dissolved in 5 ml dichloromethane and precipitated into 100 ml pentane. 
Yield: 487 mg (0.76 mmole, 76 %). 

IS EXAMPLE 10 

jgynthpdf: nf denYVfldgnngine mflss-m nHififtri at T-R nf thft heteror.vr.lic hase with 

fivrvlfrlveine. 

This derivative is prepaiwi in analogy to the glycine derivative of EXAMPLE 
9. 632.5 mg (1.0 mmole) N6-Benzoyl-8-bromo-5'-0-(4,4'-dimethoxytrityl)-2'-deoxy 
20 adenosine is suspended in 5 ml absolute ethanol and reacted with 324.3 mg (2.0 mmole) 
glycyl-glycine methyl ester in the presence of 241 .4 mg (2.1 mmole, 366 ul) N,N- 
diisopropylethylamine. The mixture is refluxed and completeness of the reaction checked by 
TLC. Work-up and purification is similar as described m EXAMPLE 9. Yield after silica gel 
column chromatography and precipitation into pentane: 464 mg (0.65 mmole, 65 %). 

25 

EXAMPLE 11 

<^Ynth>"!ig of riftOYVthymiriine mass- m nHifif><4 at the T-?' of the SUear mniBtV With ethvlmC 
plvr.nl mon6pn<^thv1 ether residues. 

Starting material is 5'-O-(4,4-dimethoxytrity0-2'-amino-2'-deoxythyimdine 
30 synthesized accoiding to published procedures [e.g. VeAeyden et al., J. Org. Chem., 36, 250- 
254 (1971); Sasaki et al., J. Org. Chem., 41, 3138-3143 (1976); Imazawa et al.. J. Org. 
Chem., 44, 2039-2041 (1979); Hobbs et al.. J.. Org. Chem., 42, 714-719 (1976); Ikehara et al., 
Chem. Phann. Bull. Japan, 26, 240-244 (1978); see also PCT Application WO 88/00201]. 5'- 
0-(4,4-Dimethoxytrityl>2'-amino-2'-deoxythymidine (559.62 mg; 1.0 mmole) is reacted with 
35 2.0 mmole of the 4-nitrophenyl ester of sucdnylated ethylene glycol monomethyl ether (see 
EXAMPLE 7) in 1 0 ml dry DMF in the presence of 1 .0 mmole (140 ul) triethylamine for 1 8 
hours at room temperature. The reaction mixture is evaporated in vacuo, co-evaporated vrith 
toluene, redissolved in dichloromethane and purified by silica gel chromatography (Si60, 
Merck, column: 2.5x50 cm; eluent: chloroform/methanol mixtures containing 0.1 % 
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triethylamine). The product containing fractions are combined, evaporated and precipitated 
into pentane. Yield: 524 mg (0.73 mmol; 73 %). 

In an analogous way, employing the 4-nitrophenyl ester of succinylated 
diethylene glycol monomethyl ether (see EXAMPLE 8) and triethylene glycol monomethyl 
5 ether the corresponding mass modified deoxythymidine is prepared. The mass difference 
between the ethylene, diethylene and trietfiylene glycol derivatives is 44.05, 88.1 and 132.15 
dalton respectively. 

EXAMPLE 12 

10 Synthesis of denxvuridine>5'-triphospha te mass-modified at C>S of the heterocvclic base with 
glycine. glvcvUglvc ine and fi-alanine residues. 

0.281 g (1.0 mmole) 5-(3-AminopropynyH)-2'-deoxyuridine (see EXAMPLE 
5) is reacted with either 0.927 g (2.0 mmole) N-Fmoc-glycine pentafluorophenylester or 
0.955g (2.0 mmole) N-Fmoc-p-alanine pentafluorophenyl ester in 5 ml dry DMF in the 

15 presence of 0.129 g N,N-diisopropylethylamine (174 ul, 1.0 mmole) overnight at room 
temperature. Solvents are removed by evaporation in vacuo and the condensation products 
purified by flash chromatography on silica gel [Still et al., J. Org. Chem., 43, 2923-2925 
(1 978)]. Yields: 476 mg (0.85 nunole: 85 %) for the glycine and 436 mg (0.76 mmole; 76 %) 
for the -alanine derivative. For the synthesis of the glycyl-glycine derivative the Fmoc group 

20 of 1 .0 mmole Fmoc-glycine-deoxyuridine derivative is removed by one-hour treatment with 
20 % piperidine in DMF at room temperature. Solvents are removed by evaporation in 
vacuo» the residue is co-evaporated twice with toluene and condensed with 0.927 g (2.0 
mmole) N-Fmoc-glycine pentafluorophenyl ester and purified following standard protocol. 
Yield: 445 mg (0.72 mmole; 72 %). The glycyl-, glycyl-glycyl- and p.alanyl.2'-deoxyuridine 

25 derivatives N-protected with the Fmoc group are now transformed to the 3 -0-acetyl 

derivatives by tritylation with 4,4-dimethoxytrityl chloride in pyridine and acetylation with 
acetic anhydride in pyridine in a one-pot reaction and subsequently detritylated by one-hour 
treatment with 80 % aqueous acetic acid according to standard procedures. Solvents are 
removed, the residues dissolved in 1 00 ml chloroform and extracted twice with 50 ml 10 % 

30 sodium bicarbonate and once with 50 ml water, dried with sodium sul&te, the solvent 

evaporated and the residues purified by flash chromatogr^hy on silica gel. Yields: 361 mg 
(0.60 mmole; 71 %) for the glycyl-, 351 mg (0.57 mmole; 75 %) for the -alanyl- and 323 mg 
(0.49 mmole; 68 %) for the glycyl-glycyl-3-0'-acetyl-2'-deoxyuridme derivatives 
respectively. Phosphorylation at flie 5'-0H with POCI3, transformation into the 5 - 

35 triphosphate by in-situ reaction with tetra(tri-n-butylammonium) pyrophosphate m DMF, 3'- 
de-O-acetylation and cleavage of the Fmoc group and final purification by anion-exchange 
chromatography on DEAE-Sephadex is perforaied. Yields according to UV-absorbance of 
the uracil moiety: 5-(3-(N-glycyl)-amidopropynyl-l)-2*-deoxyuridine-5 -trip hosphate 0.41 
mmole (84 %), 5-(3-(N— alanyl)-amidopropynyl-l)-2'-deoxyuridine-5*-tr iphosphate 0.43 



PCT/US94/02938 

WO 94/21822 

-28- 

mmole (75 %) and 5-(3-(N-glycyl-glycyl)-ainidopropynyl-l)-2'-deoxyuridine- 5'-triphosphate 
0.38 mmole (78 %). 

EXAMPLE 13 

5 Rvnthpsis of R-pivrvi. anH R.p lvr.vl-plvcvl-2'-deoyva(1enosine-!)'-tTiphoSDhate . 

727 mg (1.0 mmole) of N6-(4-tert.butylphenoxyacetyl ).8-glycyl-5'-(4,4. 
dimethoxytrityl)-2'- deoxyadenosine or 800 mg (1.0 mmole) N6-(4-tert.butylphenoxyacetyl)- 
8-glycyl-glycyl-5'-(4,4-dimethoxytrityl)-2'-deoxyadenosine prepared according to 
EXAMPLES 9 and 10 and literature [Koster et al., Tetrahsdron. 22 : 362 (1981)] are 

10 acetylated with acetic anhydride in pyridine at the 3'-0H, detritylated at the S'-position with 
80 % acetic acid in a one-pot reaction and transformed into the 5'-triphosphates via 
phosphorylation with POCI3 and reaction in-situ with telra(tri-n-butylammonium) 
pyrophosphate. Deprotection of the N6-tert-butylphenpxyacetyl, the S'-O-acetyl and the 0- 
methyl group at the glycine residues is achieved with concentrated aqueous ammonia for 

15 three hours at room temperature. Anunonia is removed by lyophillization and the residue 
washed with dichloiomethane, solvent removed by evaporation in vacuo and the remaining 
soUd material purified by anion-exchange chromatography on DEAE-Sephadex using a linear 
gradient of triethylammonium bicarbonate from 0.1 to 1.0 M. The nucleoside triphosphate 
containing fractions (checked by TLC on polyethyleneimine cellulose plates) are combined 

20 and lyophillized. Yield of the 8-glycyl-2'-deoxyadenosine.5'-triphosphate (determined by the 
UV-absorbance of the adenine moiety) is 57 % (0.57 mmole); the yield for the 8-glycyl- 
glycyl-2'-deoxyadenosine-5'-triphosphate is 51 % (0.51 mmole). 

All of the above-cited references and publications are hereby incorporated by 

25 reference. 



Equivalents 

Those skilled in the art will recognize, or be able to ascertam using no more 
30 than routine experimentation, numerous equivalents to the specific procedures described 
herein. Such equivalents are considered to be within the scrope of this invention and are 
covered by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: KOSTER, HUBERT 

(B) STREET: 1640 MONUT^NT STREET 

(C) CITY: CONCORD 

10 (D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 01742 

(G) TELEPHONE: (508) 369-9790 

15 (ii) TITLE OF INVENTION: DNA SEQUENCING BY MASS SPECTROMETRY 

(iii) NUMBER OF SEQUENCES: 6 

(v) COMPUTER READABLE FORM: 
20 (A) MEDIUM TYPE: Floppy disJc 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: ASCII text 

25 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US OB/034,738 

(B) FILING DATE: 19 March 1993 

(C) CLASSIFICATION: 

30 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/034,738 

(B) FILING DATE: 19-MAR-1993 

(C) CLASSIFICATION: 

35 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: DeConti, Giulio A. 

(B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE/DOCKET NUMBER: HKI-005PC 

40 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (617) 227-5941 



45 (2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



55 



(iii) HYPOTHETICAL: YES 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:!: 

10 

GCCTTAGCTA 
5 (2) INFORMATION FOR SEQ ID N0:2j 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other hucleic acid 
15 (iii) HYPOTHETICAL: YES 



20 



30 



35 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2! 
GCGGCCGCAG 6TCA 

12) INFORMATION FOR SEQ ID NO: 3: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEX^SS: Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
AATTCAGCG6 CCGC 

(2) INFORMATION FOR SEQ ID N0:4: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 



14. 



14 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
GGCCGCAGGT CA 

5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: CDNA 
(iii) HYPOIHETICAL: YES 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

G6CC6CTGAA TT 



25 (2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
35 (iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 6: 

40 

GCTAACTTGC 



10 
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Claiims 

1 . A method of determining a sequence of a nucleic acid, comprising 

(i) isolating the nucleic acid to be sequenced; 

(ii) cleaving the nucleic acid unilaterally from a first end with an 
5 exonuclease activity to sequentially release individual nucleotides; 

(iii) identifying each of the sequentially release nucleotides by mass 

spectrometry; and 

(iv) determining the sequence of the nucleic acid from the identified 

nucleotides. 

10 

2. The method of according to claim 1 , wherein the nucleic acid is a 2 - 
deoxyribonucleic acid (DNA). 

3 . The method of according to claim 1 , wherein the nucleic acid is a ribonucleic 
15 acid (RNA), 

4. The method of according to claim 1 , wherein the exonuclease is selected from 
a group consisting of snake venom phosphodiesterase, spleen phosphodiesterase, Bal-31 
nuclease. E. coli exonuclease I, E. coli exonuclease VII, Mung Bean Nuclease, SI Nuclease, 

20 an exonuclease activity of E. coli DNA polymerase I, an exonuclease activity of a Klenow 
fragment of DNA polymerase I, an exonuclease activity of T4 DNA polymerase, an 
exonuclease activity of T7 DNA polymerase, an exonuclease activity of Taq DNA 
polymerase, an exonuclease activity of Deep Vent DNA polymerase, and an exonuclease 
activity of Ventf DNA polymerase. 

25 

5. The method according to claim 1, wherein the exonuclease is immobilized by 
covalent attachment to a solid support, entrapment within a gel matrix, or contained in a 
reactor with a semipermeable membrane. 

30 6. The method accordmg to claim 5, wherein tiie solid support is a capillary and 

the exonuclease activity is colaventiy attached to an inner wall of the capillary. 

7. The method according to claim 5, wherein the solid siq)port is selected from a 

group consisting of glass beads, cellulose beads, polystyrene beads, Sephadex beads, 
35 Sepharose beads, polyaciylamide beads and agarose beads. 



8. The mefliod according to claim 5, wherein the solid support is a flat 

membrane. 
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9. The method according to claim 1, wherein the nucleic acid is immobilized by 

covalent attachment to a solid support and the exonuclease is in a solution contacted with the 
immoblized polynucleotide. 

5 10. The method according to claim 9, wherein the solid support is a capillary and 

the nucleic acid is colavently attached to. an inner wall of the capillary. 

11. The method according to claim 9, wherein the solid support is selected from a 
group consisting of glass beads, cellulose beads, polystyrene beads, Sephadex beads, 

1 0 Sepharose beads, polyacrylamide beads and agarose beads. 

1 2. The method according to claim 9, wherein the solid support is a flat 
membrane. 

15 13, The method according to claim 1 , wherein the nucleic acid comprises mass- 

modified nucleotides. 

1 4. The method according to claim 1 3, wherein the mass-modified nucleotides 
modulate the rate of the exonuclease activity. 

20 

15. The method according to claim 1 , v^erein the sequentially released 
nucleotides are mass-modified subsequmt to exonuclease release and prior to mass 
spectrometric identification. 

25 1 6. The method according to claim 1 5, wherein the sequentially released 

nucleotides are mass-modified by contact with an alkaline phosphatase. 

1 7. The method according to claim 1, wherein i different species of nucleic acids 

are concurrently sequenced by multiplex mass spectrometric sequencing and the sequentially 
30 released nucleotides fipm each species of the i nucleic acids can be distinguished by mass 
spectrometry fit)m the sequentially released nucleotides from the remaining i-1 nucleic acids 
based on a difference in mass due to mass-modification of at least a portion of the 
sequentially released nucleotides. 

35 ig. The method according to claim 17, herein the mass-modified nucleotides 

further serve to modulate activity of the exonuclease. 
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19. The method according to claim 1 7, wherein the mass-modified nucleotide 

comprises a mass-modifying functionality (M) attached to the sequentially released 
nucleotide. 

5 20. The method according to claim 20, wherein at least one of the mass-modified 

nucleotides is modified with a mass modifying fimctionality (M) attached to a 5' phosphate. 

21. The method according to claim 20, wherein at least one of the mass-modified 
nucleotides is modified with a mass modifying fimctionality (M) attached to a C-2' position 

10 of a sugar moiety. 

22. The method accoiding to claim 20, wherein at least one of the mass-modified 
nucleotides is modified with a mass modifying fimctionality (M) attached to a heterocyclic 
base. 

15 

23. The method accoiding to claim 22, wherein the mass modified nucleotide 
comprises amodified heterocycUc base selected from the group consisting of a cytosine 
moiety modified at C-5, an uracU moiety modified at C-5, a thymine moiety modified at the 
C-5 methyl group, an adenine moiety modified at C-8, a c7-deazaadenine moiety modified at 

20 C-8, a c7-deazaadenine moiety modified at C-7, a guanine moiety modified at C-8, a c7- 
deazaguanine moiety modified at C-8, a c7.dea2aguanine moiety modified at C-7, a 
hypoxanthine moiety modified at C-8, a c7-deazahypoxanthine moiety modified at C-8 and a 
c7-deazahypoxanthine moiety modified at C-7. 

25 24. The method according to claim 20, wherein the mass modifying fimctionality 

(M) is selected from a group consisting of XR, F, CI, Br, I, Si(CH3)3, Si(CH3)2( C2H5). 
Si(CH3)(C2H5)2, Si(C2H5)3, CH2F, CHF2. and CF3, wherein X is selected fiwm a group 
consisting of -OH, -NH2. -NHR. -SH, -NCS, .0C0(CH2) rCOOH (where r - 1-20), - 
NHC0(CH2)rC00H (where r = 1-20), -OSO2OH, .0C0(CH2)rI (wliere r - 1-20), and - 

30 0P(0-Alkyl)N(Alkyl)2, and R is selected fiwn a gro\q> consisting of H, methyl, e&yl, 

propyl, isopropyl. t-butyl, hexyl, benzyl, benzhydryl. trityl, substituted trityl, aiyl, substituted 
aryl, polyoxymethylene, monoalkylated polyoxymethylene, a polyethylene imine, a 
polyamide having a general formula [-NH(CH2)r NHC0(CH2)rC0-]m, polyamides of 
general formula [-NH(CH2)r CO-lm. polyesters of general formula [-0(CH2)rC0-]m . 

35 alkylated sUyl compounds of general formula -Si(Y)3, hetero-oligo/polyaminoacids of die 
general foraiula (-NHCHaaCO-)m, a polyethylene glycol of the general formula - 
(CH2CH20)ni-CH2CH20H, and a monoalkylated polyethylene glycol of the general 
formula -(CH2CH20)m-CH2CH20-Y, where ra is in the range of 0 to 200, Y is a lower 
alkyl group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butyl, 
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hexyl, r is in the range of 1 to 20, and aa represents any amino acid side chain of naturally 
occurring amino acids. 

25. The method according to claim 13, wherein the mass-modified nucleic acid is 
5 prepared fh}m a single-stranded template polynucleotide complementary to the nucleic acid 

sequence by synthesizing the nucleic acid using mass-modified nucleotides. 

26. The method according to claim 25, wherein the mass-modified nucleic acid is 
synthesized using a primer having a sequence which allows the nucleic acid to be anchored 

10 to the solid support. 

27. The method according to claim 25, wherein the mass-modified nucleic acid is 
synthesized using a DNA polymerase and mass-modified deoxyribonucleoside triphosphates 
(dNTPs). 

15 

28. The method according to claim 25, wherein the mass-modified nucleic acid is 
synthesized using an RN A polymerase and mass modified libonucleoside triphosphates 
(NTPs), 

20 29. The method accordmg to claun 25, wherein synthesis of the nucleic acid is 

initiated in the presence of an initiator oligonucleotide having a 5 -fimctionality allowing the 
nucleic acid to be immobilize on the solid support 

30. The method according to claim 9, wherein the nucleic acid finther comprises a 
25 linking groiq> (L) for covalently attaching the nucleic acid to the solid support 

3 1 . The method according to claim 30, wherein the solid support fiirther 
comprises a splint oligonucleotide and the linking group (L) comprises a nucleotide sequence 
able to anneal to the splint oligonucleotide and be covalently attached to the solid support by 

30 action of a ligase activity. 
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32. A system for exonuclease-mediated mass spectrometric sequencing 
comprising 

(i) reactor means for containing exonuclease cleavage reactions of the 
target nucleic acids, the cleavage reaction forming a train of sequentially released individual 

5 nucleotides; 

(ii) mass spectrometric means for detecting the train of individual 
nucleotides released by the exonuclease cleavage of the target nucleic acid; and 

(iii) transfer means for transferring the train of individual nucleotides from 
the reactor means to the mass spectrometric means. 

10 

33 . The system of claim 32, wherein the transfer means further comprises a 
moving belt. 

34. The system according to claim 32, wherein the reactor means further 

15 comprises a reactor with a cooling/heatmg mantle for controlling the temperature of the 
exonuclease cleavage reaction, at least one reagent flask for providing at least one reagent to 
the reactor, at least one pumping device for pumpmg the reagents and leactants to and from 
the reactor, at least one heatmg/cooUng coil for controlling the temperature of reagents 
provided to the reactor, and means for immobilizing one of the target nucleic acids or the 

20 exonuclease in the reactor. 

35. The system according to claim 34, wherein the reactor means further 
comprises at least one secondary reactor for mass-modifying the mdividual nucleotides 
released by the exonuclease cleavage of the target nucleic acids. 

25 

36. The system according to claim 35, wherein the secondary reactor comprises 
inmiobilized alkaline phosphatase. 

37. The system accordmg to claim 33, m which the moving beh comprises an 
30 endless moving belt driven by a stepping motor and held under tension by spring-loaded 

pulleys, a sample iq)plication area for applying a sample of the individual nucleotides 
released by the exonuclease cleavage of the target nucleic acids to a portion of the movmg 
belt, a cooling/heating plate positioned beneath the sample application area for controUmg the 
temperature of the sample application area, a CCD camera with optics for viewing the sample 
35 application area, a source for ionization/desoiption capable of moving across the surface of 
the belt in a dhection perpendicular to the motion of the moving belt, differential vacuum 
pumping stages for generating a vacuum over a protion of the moving belt, and a heating 
device to facilitate removal of sample residue from the moving belt. 
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38, The system accowling to claim 37, fiirther comprising a washing station to 
wash the moving belt with a cleaning solution prior to application of the sample, wherein the 
heating device facilitates drying the belt after rinsing with the cleaning solution. 

39, The system of claim 37, v»*erein the heating device is a microwave. 

40, The system according to claim 37, in which the ionization/desorption source is 
a laser. 



10 41. 



The system according to claim 32, further comprising a plurality of reactor 



means for parallel sequencing of a plurality of nucleic acids. 

42. The system of claim 32, further comprising a microprocessor for processing 
the detected train of released nucleotides and determining the sequence of the target nucleic 

15 acids. 

43. A kit for exonuclease sequencing at least two different species of nucleic acids 

by mass spectrometry, comprising 

0) an exonuclease for cleaving the nucleic acids unilaterally from a first 

20 end to sequentially release individual nucleotides; 

(ii) a set of nucleotides for synthesizing the different species of nucleic 
acids, at least a portion of the nucleotides being mass-modified such that sequentially 
released nucleotides of each of the different species of nucleic acids are distinguishable, 

(iii) a polymerase for synthesizing the nucleic acids fix>m complementary 
25 templates and the set of nucleotides, and 

(iv) a solid support for immobilizing one of the nucleic acids or the 

exonuclease. 

44. The kit ofaccording to claim 43, wherein the nucleic adds are 2- 
30 deoxyribonucleic acids (DNA). 

45. The kit ofaccording to claim43,v*erein the nucleic acids are ribonucleic 
acids (RNA). 

35 46. The kit of according to claim 43, wherein the exonuclease is selected fiom a 

group consisting of snake venom phosphodiesterase, spleen phosphodiesterase, Bal-31 
nuclease, E. coU exonuclease I, E. coli exonuclease VII, Mung Bean Nuclease, SI Nuclease, 
an exonuclease activity of E. coU DNA polymerase I, an exonuclease activity of a Klenow 
fiaginent of DNA polymerase I, an exonuclease activity of T4 DNA polymerase, an 
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exonuclease activity of T7 DNA polymerase, an exonuciease activity of Taq DNA 
polymerase, an exonuclease activity of Deep Vent DNA polymerase, and an exonuclease 
activity of Ventf DNA polymerase. 

5 47. The kit according to claim 43, wherein the solid support is selected from a 

group consisting of a capillary, a flat membrane, glass beads, cellulose beads, polystyrene 
beads! Sephadex beads, Sepharose beads, polyacrylamide beads and agarose beads. 

48. The kit according to claim 43, wherein die solid support is functionalized to 
1 0 facilitate covalent attachment of the nucleic acids. 

49. The kit according to claim 43, wherein the exonuclease is immobilized by 
covalent attachment to the solid support. 

, 5 50. The kit according to claim 43, wherein the mass-modified nucleotide 

comprises a mass-modifying functionality (M) attached to a nucleotide moiety. 

51 . The method according to cUim 50, wherein the mass-modifying functionality 
(M) is attached to a nucleotide moiety at a position selected firom a group consisting of a 5' 

20 phosphate, a C-2' position of a sugar moiety, and a heterocyclic base. 

52. The method according to claun 50, wherein the mass modifying functionality 
(M) is selected from a group consisting of XR, F, CI, Br, I, Si(CH3)3, Si(CH3)2( C2H5), 
Si(CH3)(C2H5)2, Si(C2H5)3. CH2F, CHF2. and CF3. wherem X is selected from a group 

25 consisting of -OH. -NH2, -NHR, -SH. -NCS. -0C0(CH2) rCOOH (where r = 1-20). - 
NHC0(CH2)rC00H (where r= 1-20). -OSO2OH, -0C0(CH2)rI (where r« 1-20), and - 
0P(0-Alkyl)N(Alkyl)2, and R is selected from a group consisting of H, methyl, etl^l. 
propyl, isopiopyl, t-butyl, hexyl, benzyl, benzhydryl, trityl, substituted trityl, aiyl, substituted 
aryl, polyoxymethylene, monoalkylated polyoxymethylene, a polyethylene imine, a 

30 polyamide having a general formula [-NH(CH2)r NHC0(CH2)rC0-]m, polyamides of 
general formula [-NH(CH2)r CO-Jm. polyesters of general formula [-0(CH2)rCO]m . 
alkylated silyl compounds of general formula -Si(Y)3, hetero-oUgo/polyaminoacids of the 
general formula (.NHCHaaCO-)m. a polyethylene glycol of the general formula - 
(CH2CH20)m-CH2CH20H, and a monoalkylated polyethylene glycol of the general 

35 fonnula-(CH2CH20)m-CH2CH20-Y, where mis in the range ofO to 200, Visa lower 

alkyl group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butyl, 
hexyl, r is in the range of 1 to 20, and aa represents any amino acid side chain of naturally 
occurring amino acids. 
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53 . The kit according to claim 43, further comprising an instruction manual 
providing the endonuclease sequencing protocol. 

54. A kit for sequencing nucleic acid by exonuclease-mediated mass spectrometry, 
5 comprising 

(i) an exonuclease for cleaving the nucleic acid unilaterally from a first 
end to sequentially release individual nucleotides; 

(ii) a set of nucleotides for synthesizing the nucleic acid, at least a portion 
of the nucleotides being mass-modified to modulate cleavage activity of the exonuclease, 

,Q (iii) a polymerase for synthesizing the nucleic acids from complementary 

templates and the set of nucleotides, and 

(iv) a solid support for immobilizing one of the nucleic acids or the 

exonuclease. 

,5 55. A method ofdetermining a sequence ofa nucleic acid, comprising 

(i) isolating the nucleic acid to be sequenced; 

(ii) cleaving the nucleic acid unilaterally from a first end with an 
exonuclease activity to produce a set of nested nucleic acid fragments; 

(iii) detennining the molecular weight value of each one of the set of 
20 nucleic adid fragments by mass spectiometiy; and 

(iv) detenninmg the sequence of the nucleic acid from the molecular 
weight values of the set of nucleic acid fragments. 
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